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NORM INEQUALITIES RELATED TO THE MATRIX GEOMETRIC MEAN 


RAJENDRA BHATIA AND PRIYANKA GROVER 


Abstract. Inequalities for norms of different versions of the geometric mean of two posi¬ 
tive definite matrices are presented. 


1. Introduction 


The geometric mean of positive numbers a and b is the number \fab, and it satisfies the 
equations 

Vab = e^ (log “ +logi) = lint ( aP + bP \ . (1) 

p^o\ 2 j 

The quantity 


f(p) = 


+ b p \ 


i/p 




-oo < p < oo, 


( 2 ) 


is called the binomial mean, or the power mean, and is an increasing function of p on 

(—oo, oo). 

Replacing a and b by positive definite matrices A and B, let 

(A p + B p \ 1/p 


F(p) — 


V 2 


- 


(3) 


In []7l] Bhagwat and Subramanian showed that 


lint F(p) = e hbogA+\o g B)' 

p— 


(4) 


They also showed that the matrix function F(p) is monotone with respect to p, on the 
intervals (—oo, —1] and [1, oo) but not on (—1,1). (The order X < Y on the space P of 
nxn positive definite matrices is defined to mean Y — X is a positive semidefinite matrix.) 

The entity in d4]) is called the “log Euclidean mean” of A and B. However it has some 
drawbacks, and the accepted definition of the geometric mean of A and B is 

A# l/2 B = A 1 ' 2 (A -1/2 5A -1/2 ) 1/2 A l/2 . (5) 
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It is of interest to have various comparisons between the quantities in (©, ([4]) and d5]), 
and that is the question discussed in this note. 

Generalising © various authors have considered for 0 < t < 1 

A# t B = A 1 ' 2 (A-^BA- 1 ' 2 ) 1 A 1 ' 2 , (6) 

and called it t-geometric mean, or t-power mean. In recent years there has been added 
interest in this object because of its connections with Riemannian geometry The space 
P has a natural Riemannian metric, with respect to which there is a unique geodesic joining 
any two points A, B of P. This geodesic can be parametrised as ©. 

The linear path 

(1 -t)A + tB, 0<t<l, (7) 

is another path in P joining A and B. It is well known Exercise 6.5.6] that 

A# t B < (1 — t)A + tB for all 0 < t < 1. (8) 

The special case t — 1/2 of this is the matrix arithmetic-geometric mean inequality, first 
proved by Ando |Tj]. 

For 0 < t < 1 let 

F t (p) = ({l-t)A* + tB*) 1/p . (9) 

For t — 1/2 this is the F defined in ©. It follows from the work in [jT]] that 

Urn FAp) = e (i-i)iogA+tio g S ; (10) 

p —>o 

and that F t (p) is monotone with respect to p on (— oo, —1] and [1, oo) but not on (—1,1). We 
denote by Xj (X ) , 1 < j < n, the decreasingly ordered eigenvalues of a Hermitian matrix 
X, and by HI • HI any unitarily invariant norm on the space Mofnxn matrices. Our first 
observation is that while the matrix function F t (p) defined in ([9]) is not monotone on the 
whole line (—oo, oo), the real functions A j(F t (p)) are: 

Theorem 1. Given positive definite matrices A and B, let F t (p) be as defined in Q. Then for 
1 < j <n the function A j(F t (p)) is an increasing function of p on (— oo, oo). 

As a corollary |||F t (p)||| is an increasing function of p on (—oo, oo). In contrast to this, 
Hiai and Zhan HT8l1 have shown that the function ||| ( A p + B v ') 1 ^ v ||| is decreasing on (0,1] 
(but not necessarily so on (1, oo)). A several variable version of both our Theorem []] and 
this result of Hiai and Zhan can be established (see Remark 1). 
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Combining Theorem [T] with a result of Ando and Hiai fl3D we obtain a comparison of 
norms of the means ([3]), (@1), (J5|), and their t -generalisations: 

Corollary 2. Let A and B be two positive definite matrices. Then for p > 0 

\\\A# t B\\\ < lllea-tJiogA+tiogflin < in ((i -t)A p + tB p ) 1/p \\\. (11) 

The first inequality in (fill) is proved in [J3]] as a complement to the famous Golden- 
Thompson inequality: for Hermitian matrices H, K we have 11|e if+ft, |11 < |||eV|||. Stronger 
versions of this inequality due to Araki [|2] and Ando-Hiai U3J can be used to obtain a re¬ 
finement of (fTll) . We have for 0 < t < 1 

\\\A# t B\\\ < ||| e d-h logj4+i logB||| 

< \\\{B^ A^- t)p B^) 1/p \\\ 

< \\\((l-t)A p + tB p ) 1/p \\\. (12) 

We draw special attention to the case p = 1 for which further refinements are possible. 


Theorem 3. Let A and B be positive definite matrices. Then 


|||-4#uB||| < HleM^+tiogBi 


< 

< 

< 


\\\B^ A 1 - 1 B^\\\ 

I (^A 1 ~ t B t + B t A 1 ~ t ) 

|||A 1 - i 5 i ||| 


< |||(1 — t)A + tB\\\. 


(13) 


For convenience we have stated these results as inequalities for unitarily invariant norms. 
Many of these inequalities have stronger versions (with log majorisations instead of weak 
majorisations). This is explained along with the proofs in Section 2. For the special case 
t — 1/2 we provide an alternative special proof for a part of Theorem [3j and supplement it 
with other inequalities. Section 3 contains remarks and comparisons with known results, 
some of which are very recent. 
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2. Proofs 

Proof of Theorem [Tj Let 0 < p < p'. Then the map f(t) = t p ! p ' on [0,oo) is matrix 
concave; see [|8l Chapter V]. Hence 

(1 - t)A p + tB p < ((1 - t)A p ' + tB p 'y /p '. 

This implies that 

A j ((1 - t)A p + tB p ) < Xj ((1 - t)A p ' + tB p ') p/p '. 

Taking pth roots of both sides, we obtain 

Xj ((1 - t)A p + tB p ) 1/p < Xj ((1 - t)A p ' + tB p ') 1/p '. (14) 

Next consider the case p < p' < 0. Then 0 < p'/p < 1. Arguing as above we obtain 

Xj ((1 - t)A p ' + tB p ') < Xj ((1 - t)A p + tB p ) p ' /p . 

Take p'th roots of both sides. Since p' < 0, the inequality is reversed and we get the 
inequality (fl4l) in this case too. Now let p be any positive real number. Using the matrix 
convexity of the function f(t) = f -1 we see that 

((1 - t)A~ p + tB~ P Y 1 < (1 - t)A p + tB p . 

From this we get an inequality for the jth eigenvalues, and then for their pth roots; i.e., 

Xj ((1 - t)A~ p + tB- p )~ 1/p < X 3 ((1 - t)A p + tB p ) l/p . 

It follows from the above cases that for any p < 0 < p' 

Xj ((1 - t)A p + tB p ) 1/p < Xj ((1 - t)A p ' + tB p ') W . (15) 

Taking limit as p' —> 0 and using (fTOl) we get 

Xj ((1 - t)A p + tB p ) 1/p < Xj (gd-biogA+tiogs^ 

i.e., for any p < 0 we have Xj (. F(p )) < Xj (F( 0)). For the case p > 0 a similar argument 
shows that X 3 (. F(p )) > X 3 (F(0)). ■ 

Proof of Theorem [3] The first inequality in (1T3T) follows from a more general result of 
Ando and Hiai D3j]. They showed that for Hermitian matrices H and K, \ \ \ (e pH # t e pK ^j ^ \ \ \ 
increases to \\\e ! - ] ^ H+tK \\\ as p f 0. Choosing H = log A, K = log B, and p — 1, we 
obtain the first inequality in (Il3l) . The Golden-Thompson inequality generalised to all 
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unitarily invariant norms (see fl8l p. 261] says that |||e H+x ||| < \\\e K ^ 2 e H e K ^ 2 \\\. Using 
this we obtain the second inequality in (fT3l) . (We remark here that it was shown in [fTOTl 
that the generalised Golden Thompson inequality follows from a generalised exponential 
metric increasing property. The latter is related to the metric geometry of the manifold 
P. So its use in the present context seems natural.) Given a matrix X we denote by ReX 
the matrix |(X + X*). By Proposition IX. 1.2 in |8|] if a product XY is Hermitian, then 
|||XY||| < ||| Re(YX)|||. Using this we obtain the third inequality in (fT3l) . The fourth 
inequality follows from the general fact 111 ReX| 11 < 11 |X| 11 for all X. The last inequality in 
(fT3l) is a consequence of the matrix Young inequality proved by T. Ando [{2j. ■ 

For Hermitian matrices H, K let Ai (H) > ■ ■ ■ > A n (H) and A] (K) > • • • > X n (K) be the 
eigenvalues of H and K respectively. Then the weak majorisation A (H) -< w A (K) means 
that 

k k 

k=l,2,...,n. 

i =1 i=1 

If in addition for k = n there is equality here, then we say A (H) -< A (K). For A, B > 0 we 
write 


A(A) ^ Iog A (B) 


if 

k k 

<11^(5), k — 1 ,... ,n — 1 (16) 

i=l i=l 

and 


n ^(-4) = n a i(B ), that is det A = det B. 

1=1 1=1 

We refer to it as log majorisation. We say A is weakly log majorised by B, in symbols 
A(A) -< w io g A (B), if d2]) is fulfilled. It is known that 


A(A) 

^Ywlog A (B) implies A(A) A (B), 

so that |||A||| < 11|R|11 for any unitarily invariant norm. (See Q8I] for facts on majorisation 
used here.) 
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There are stronger versions of some of the inequalities in (fl2l) . We have for p > 0 

A (A# t B) -<i og A( e (1 - t)logA+ * logB ) 

^ log A ( B tp / 2 A {1 - t)p B tp/2 y /p 
= A ( A {1 ~ t)p B tp ) 1/p 

A wlog A((l -t)A* + tB*) 1/p . (17) 

The first inequality is a result by Ando and Hiai 030. The second inequality follows from a 
result by Araki 060. The last inequality above follows from the matrix version of Young’s 
inequality by Ando 020. 

A further strengthening of the first inequality in (IT71) replacing log majorisation by point- 
wise domination is not possible. For t — 1/2 this would have said 

A j(A# 1/2 B) < Xj (e 12 ^ 2 ^) . 

r 2 o 

This is refuted by the example A = ^ 

A 2 (7L#i/ 2-B) = 1 and \ 2 (e'^£ 2 ^~) s=s 0.9806. 

The case t — 1/2, p — lis special. Following an idea of Lee 0200 we present a different 
proof of the majorisation 

A (A# 1/2 b) A log A (B^A^B 1 / 4 ) . (18) 

The geometric mean A#i/ 2 B satisfies the equation A#i/ 2 B = AA^UB 1 ! 2 for some unitary 
U. See 08} p.109]. Therefore for the operator norm || • || we have 

\\A#i,2B\\ = \\A 1/2 UB 1/2 \\ 

= \\a 1/4 a 1/4 ub 1/4 b 1/4 \\ 

< \\a 1/4 ub 1/4 b 1/4 a 1/4 \\ 

< \\A 1/4 UB 1/4 \W\B 1/4 A 1/4 \\. (19) 

Here the first inequality is a consequence of the fact that if XY is Hermitian, then ||AfY|| < 
|| YX ||. Next note that 

||A 1/4 (/5 1/4 || 2 = \\A 1/4 UB 1/2 U*A 1/4 \\ 

< \\a 1/2 ub 1/2 u*\\ 

= \\A 1/2 UB 1/2 \\ = ||A# 1/2j B||. (20) 


B = 


3 

3 


3 

9/2 


. A calculation shows that 
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Again, to derive the inequality above we have used the fact that ||XY|| < ||YX|| if XY is 
Hermitian. From (fT9l) and (1201) we see that 

||A#i /2 5|| 1/2 <||5 1/4 A 1 / 4 ||, 

and hence 

\\A# 1/2 B\\ < WB^A^r = WB^A^B^l (21) 

This is the same as saying that 

A, (A# 1/2 £) < A, (b^A^B 1 ' 4 ) . (22) 

If A fc (-A), 1 < k < n, denotes the fcth antisymmetric tensor power of X, then 

A fc (A# 1/2 B) = A fc (A )# 1/2 A fc ( B ). 

So from (l22l) we obtain 

Ai (A fc (A# 1/2 B)) < Ai (A k (B) 1 ' 4 A k (A) 1 / 2 A k {B) 1 ' 4 ) . 

This is the same as saying 

n (^# 1/2 b^ < ]/[ A j (^B 1 / 4 A 1 / 2 B 1 / 4 'j , 1 < k < n. (23) 

3 =1 3 =1 

For k — n there is equality here because 

det (A# 1/2 B) = det (A iI2 B 1 / 2 ) . 

From (l23l) we have the corollary 

A (A# 1/2 S) A w A (b^A^B 1 ' 4 ) . (24) 

Included in this is the trace inequality 

tr (a# 1/2 b) < tr A 1/2 B 1/2 . 

This has been noted in Il2()il . 

3. Remarks 

1. Let Ai,..., A m be positive definite matrices and let an ,a m > 0 be such that 
oij = 1. Let 

F( P ) = (a 1 A p + --- + a m A p n ) 1/p . (25) 

Then by the same argument as in the proof of Theorem HI A j{F(p)) is increas¬ 
ing in p on (— 00 , 00 ). In particular for a\ = ••• = a m = 1/m the function 
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A j (( Al+ ' m +ylm ) ; j is an increasing function of p on (— oo, oo). Therefore 


f A P+-+AP 1 \ 1/p 




is an increasing function of p on (— oo, oo). In contrast, it can be 


shown that 


(A P 1 + --- + A P J 


i/p 


is a decreasing function of p on (0,1]. For rri — 2 
Hiai and Zhan have shown this using the following result of Ando and Zhan 05]]. 
For positive operators A, B and r > 1 


||p + B) r ||| > |||A r + J B r |||. 

A several variable version of this follows from [14| Theorem 5 (ii)] of Bhatia and 
Kittaneh: 


is a de- 


|||(A + • • • + A m ) r ||| > |||A] +-h A r m HI for r > 1. 

By imitating the argument in HT8T] one can show | (A^ +-h A p rn ) 1 ^ p 

creasing function of p on (0,1]. 

2. In [0] Bhagwat and Subramanian showed that for positive definite matrices Ai,...,A r 
and «i,.. •, a m > 0 such that J2 c*j — 1 

lim ( ai A{ + • • • + a m A p m ) 1/p = e ai iogA 1+ -+a m io g A m ^ 

p—> 0 

It follows from Remark 1 that 

| | | gQ! log Al 4 VOLm log Am | | | <||| ( aiA p + ... + a m A p m ) 1/p III for p > 0. 


3. Recently several versions of geometric mean for more than two positive definite 
matrices have been considered by various authors. (See H, Ifl2ll and I1T611 .') For 
positive definite matrices Ai,..., A rn let G(Ai ,..., A m ) denote any of these geo¬ 
metric means. Our discussion in Corollary 2 and Remark 2 raises the question 
whether 


III G(A 1 ,...,A r 


< 


log Aj H-Flog Ar, 


4. By Ando’s characterisation of the geometric mean if X is a Hermitian matrix and 


A X 
X B 


> 0, then A" < A^B. 


A 

-X ' 


' I 

0 

' A 

X ' 

' I o' 

-X 

B 


0 

-/ 

X 

B 

0 -I 


Since 
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we have 


A 

-X ' 

>0 if 

' A 

X ' 

-X 

B 


X 

B 


Hence ±X < A#B. Then by Ifl5l Lemma 2.1], |||A"||| < |||A#i?|||. In contrast to 
this, we do have that 


A 

A 1/2 B 1/ 2 ■ 


' A 1 / 2 0 ' 

' a x ' 2 B V 2 ' 

b 1 / 2 a 1 / 2 

B 


B 1 ' 2 0 

0 0 


but we have the opposite inequality |||A#I?||| < |||A 1 / 2 i? 1//2 |||. 

5. Among the several matrix versions of the arithmetic-geometric mean inequality 
proved by Bhatia and Kittaneh [TT3T1 one says that 4|||AB||| < |||(A + 5) 2 |||. Using 
this and Theorem |T| we have 


|||A 1/2 5 1 / 2 ||| < 




< 



(26) 


For t = 1/2 this extends the chain of inequalities ((131) in another direction. 

6. In a recent paper H2TH Matharu and Aujla have shown that 


A (A# t B) ^ log A (A'-'B*) . (27) 

For their proof they use the Furuta inequality. The inequality (fT8l) follows from 
this. As a corollary these authors observe that 


|||71#i/ 2j B||| < |||( J B 1 / 2 Afi 1 /2 ) i/2|||_ (28) 


In fact, from (|27l) one can deduce the stronger inequality (|24l) . By IX.2.10 in Q8D 
we have for A, B positive definite and 0 < t < 1 

III^A^IH < \\\(BABY\\\. 


So, the inequality (|24l) is stronger than (1281) . In turn, the latter inequality is 
stronger than one proved by T. Kosem [fT9l1 who showed 

|||(/1#i/ 2 B) 2 |||<|||B 1/ MB 1 /2|||. 


This follows from (l28l) because the majorisation x -< w y for positive vectors implies 

x 2 -< w y 2 - 


7. The third inequality in (TT3T) can be derived from the arithmetic-geometric mean 
inequality of Bhatia-Davis UTOl 

1 


|||A 1/2 XA 1/2 ||| < 


-(AX + XA) 
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valid for all A" and positive definite A. There are several refinements of this in¬ 
equality, some of which involve different means (Heinz means, logarithmic means, 
etc.) Each such result can be used to further refine (fT3l) . 
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